Algorithms for computing strategies in two-player simultaneous move games

نویسندگان

  • Branislav Bosanský
  • Viliam Lisý
  • Marc Lanctot
  • Jiri Cermak
  • Mark H. M. Winands
چکیده

Simultaneous move games model discrete, multistage interactions where at each stage players simultaneously choose their actions. At each stage, a player does not know what action the other player will take, but otherwise knows the full state of the game. This formalism has been used to express games in general game playing and can also model many discrete approximations of real-world scenarios. In this paper, we describe both novel and existing algorithms that compute strategies for the class of two-player zero-sum simultaneous move games. The algorithms include exact backward induction methods with efficient pruning, as well as Monte Carlo sampling algorithms. We evaluate the algorithms in two different settings: the offline case, where computational resources are abundant and closely approximating the optimal strategy is a priority, and the online search case, where computational resources are limited and acting quickly is necessary. We perform a thorough experimental evaluation on six substantially different games for both settings. For the exact algorithms, the results show that our pruning techniques for backward induction dramatically improve the computation time required by the previous exact algorithms. For the sampling algorithms, the This author has a new affiliation: Google DeepMind, London, United Kingdom. Preprint submitted to Artificial Intelligence April 6, 2016 results provide unique insights into their performance and identify favorable settings and domains for different sampling algorithms.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Complexity of Infinitely Repeated Alternating Move Games

We consider infinite duration alternating move games. These games were previously studied by Roth, Balcan, Kalai and Mansour [10]. They presented an FPTAS for computing an approximate equilibrium, and conjectured that there is a polynomial algorithm for finding an exact equilibrium [9]. We extend their study in two directions: (1) We show that finding an exact equilibrium, even for two-player z...

متن کامل

The Complexity of Alternating Move Games

We consider infinite duration alternating move games. These games were previously studied by Roth, Balcan, Kalai and Mansour [10]. They presented an FPTAS for computing an approximated equilibrium, and conjectured that there is a polynomial algorithm for finding an exact equilibrium [9]. We extend their study in two directions: (1) We show that finding an exact equilibrium, even for two-player ...

متن کامل

Strategy Improvement for Concurrent Safety Games

We consider concurrent games played on graphs. At every round of the game, each player simultaneously and independently selects a move; the moves jointly determine the transition to a successor state. Two basic objectives are the safety objective: “stay forever in a set F of states”, and its dual, the reachability objective, “reach a set R of states”. We present in this paper a strategy improve...

متن کامل

Investigating the Effect of Rewards on Individual Players' Efforts: A Behavioral Approach

The main goal of the study is to examine the effect of rewards on the behavior of players in a team activity. In this framework, by performing 12 sequential and simultaneous games in a laboratory environment examine the rewarding effect on players' behavior. Students from Yazd universities surveyed and the sample of 182 students is in the form of two groups, which collected in total for 2184 ma...

متن کامل

A Generic Bound on Cycles in Two-Player Games

We provide a bound on the size of simultaneous best response cycles for generic finite two-player games. The bound shows that no cycle will move through the entire strategy space as long as either player has more than two strategies. This bound increases quadratically in the size of the strategy spaces. It is the tightest possible in the sense that we can construct a generic game with a cycle t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Artif. Intell.

دوره 237  شماره 

صفحات  -

تاریخ انتشار 2016